Modelling test day data
نویسندگان
چکیده
Four statistical models for genetic evaluations utilising dairy test day data are considered. These are: a fixed regression model, a random regression model, an autoregressive model and a multiple trait model. The emphasis is put on the comparison of these models in terms of their assumed covariance structure, modelling and prediction of breeding values and parameterisation. In the future one of the models should be used for a routine genetic evaluation of the Polish Black-and-White dairy cattle. Therefore, characteristics of test day data from the Polish population are given. In conclusion, it appears that thanks to its flexibility in handling heterogeneous variances during lactation, variable autocorrelation, and nonuniform spacing between tests, random regression model forms the most suitable approach. Key-words: dairy cattle, genetic evaluation, statistical modelling , test day data Introduction Longitudinal data consist of sets of multiple observations scored on one subject (e.g., a person or an animal) repeatedly in time. Milk production measured on cows on several test days during lactation forms a classical example of longitudinal data. From the statistical perspective, the most important feature of the analysis of such repeated observations is the ability to model the correlation between them. During the last two decades appropriate statistical methods have been developed and applied mainly in the analysis of small, experimental data sets (for overview see: DIGGLE et al. 1994). Thanks to the progress in computer hardware technology, the analysis of large amounts of data has become feasible also for modelling of test day data in dairy cattle (PTAK, SCHAEFFER 1993, SCHAEFFER, DEKKERS 1994, JAMROZIK et al. 1997). Moreover, attempts have been made recently to use test day information in national genetic evaluations for dairy cattle (REENTS et al. 1995, JAMROZIK et al. 1997). The main objective of this paper is to compare four models available for genetic evaluation based on dairy test day yields. The model which suites best the characteristics of test day data from the dairy population of the Polish Black-and-White cattle is chosen, as a desirable model for a routine genetic evaluation. Genetic evaluation models for dairy test day data Modelling of test day data Test day data sets resulting from routine milk recording programs are considerably larger and have a more complicated covariance structure than data from designed experiments. The most characteristic features of test day data comprise: (i) between-subject correlation, (ii) differences between means and variances of measurements taken at different stages of lactation, (iii) correlation between test day yields decaying with increasing time between test days, meaning that outcomes of adjacent tests are more correlated than outcomes of remote tests, (iv) unequal spacing of test days throughout lactation. In the modelling of test day data the aforementioned features have to be accounted for. Traditionally, cow milk production measured on consecutive test days has been transformed into an aggregated 305-day yield. Such assembling of data on the phenotypic level has many disadvantages, such as the need for projecting of partially terminated records or inability to model changes in environment throughout the lactation. In order to avoid using the aggregated 305-day lactation records, various approaches to modelling of test day data directly have been developed. These are: (i) fixed and random regression models (PTAK, SCHAEFFER 1993, SCHAEFFER, DEKKERS 1994, REENTS et al. 1995, JAMROZIK et al. 1997, MEYER 1997), (ii) first-order autoregressive models (CARVALHEIRA et al. 1998), (iii) multiple trait models (WIGGANS, GODDARD 1997; GENGLER et al. 1999). The most important characteristics of these models are outlined below. Fixed regression model The fixed regression model (FRM) was the first model proposed for modelling of dairy test day data (PTAK, SCHAEFFER 1993, REENTS et al. 1995): ∑ = + + + + = + + + + =
منابع مشابه
Vector Autoregressive Model Selection: Gross Domestic Product and Europe Oil Prices Data Modelling
We consider the problem of model selection in vector autoregressive model with Normal innovation. Tests such as Vuong's and Cox's tests are provided for order and model selection, i.e. for selecting the order and a suitable subset of regressors, in vector autoregressive model. We propose a test as a modified log-likelihood ratio test for selecting subsets of regressors. The Europe oil prices, ...
متن کاملMetabolomic Profiling of Submaximal Exercise at a Standardised Relative Intensity in Healthy Adults
Ten physically active subjects underwent two cycling exercise trials. In the first, aerobic capacity (VO2max) was determined and the second was a 45 min submaximal exercise test. Urine samples were collected separately the day before (day 1) , the day of (day 2), and the day after (day 3) the submaximal exercise test (12 samples per subject). Metabolomic profiling of the samples was carried out...
متن کاملDaily Pan Evaporation Modelling With ANFIS and NNARX
Evaporation, as a major component of the hydrologic cycle, plays a key role in water resources development and management in arid and semi-arid climatic regions. Although there are empirical formulas available, their performances are not all satisfactory due to the complicated nature of the evaporation process and the data availability. This paper explores evaporation estimation methods based o...
متن کاملAutomatically Generating Roof Models from Building Footprints
Modelling Large Urban Environments using traditional modelling techniques would prove too time consuming a process. Consequently a method is required for generating large urban environments quickly without requiring a labour intensive process or expensive data. Some approaches to resolve this problem have been undertaken through the use of procedural modelling or statistics. However the test su...
متن کاملThe Factor Structure of a Written English Proficiency Test: A Structural Equation Modeling Approach
The present study examined the factor structure of the University of Tehran English Proficiency Test (UTEPT) that aims to examine test takers’ knowledge of grammar, vocabulary, and reading comprehension. A Structural Equation Modelling (SEM) approach was used to analyse the responses of participants (N= 850) to a 2010 version of the test. A higher-order model was postulated to test if the unde...
متن کامل